Cross-modal distillation for RGB-depth person re-identification

نویسندگان

چکیده

Person re-identification is a key challenge for surveillance across multiple sensors. Prompted by the advent of powerful deep learning models visual recognition, and inexpensive RGB-D cameras sensor-rich mobile robotic platforms, e.g. self-driving vehicles, we investigate relatively unexplored problem cross-modal persons between RGB (color) depth images. The considerable divergence in data distributions different sensor modalities introduces additional challenges to typical difficulties like distinct viewpoints, occlusions, pose illumination variation. While some work has investigated infrared, take inspiration from successes transfer object detection tasks. Our main contribution novel method distillation robust person re-identification, which learns shared feature representation space person’s appearance both In addition, propose attention mechanism where gating signal one modality can dynamically activate most discriminant CNN filters other modality. proposed compared conventional approaches cross-domain Results obtained on public BIWI RobotPKU datasets indicate that significantly outperform state-of-the-art up 16.1% mean Average Precision (mAP), demonstrating benefit paradigm. experimental results also using allows improve recognition accuracy considerably with respect relevant approaches. 1 Code: https://github.com/frhf/cross-modal-distillation-reidentification . • Cross-modal training procedure embedding Exploiting intrinsic relation RGB. Ideal needs place Novel gated improves our approach.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supplementary Material for “RGB-Infrared Cross-Modality Person Re-Identification”

This supplementary material accompanies the paper “RGB-Infrared Cross-Modality Person Re-Identification”. It includes more details of Section 4, as well as extra evaluations of our proposed deep zero-padding method. 1. Details of Counting Domain-Specific Nodes In the third paragraph of Section 4.2 in the main manuscript, we quantify the number of domain-specific nodes in the trained network in ...

متن کامل

Cross Dataset Person Re-identification

Until now, most existing researches on person re-identification aim at improving the recognition rate on single dataset setting. The training data and testing data of these methods are form the same source. Although they have obtained high recognition rate in experiments, they usually perform poorly in practical applications. In this paper, we focus on the cross dataset person re-identification...

متن کامل

Hierarchical Cross Network for Person Re-identification

Person re-identification (person re-ID) aims at matching target person(s) grabbed from different and non-overlapping camera views. It plays an important role for public safety and has application in various tasks such as, human retrieval, human tracking, and activity analysis. In this paper, we propose a new network architecture called Hierarchical Cross Network (HCN) to perform person re-ID. I...

متن کامل

Person Depth ReID: Robust Person Re-identification with Commodity Depth Sensors

This work targets person re-identification (ReID) from depth sensors such as Kinect. Since depth is invariant to illumination and less sensitive than color to day-by-day appearance changes, a natural question is whether depth is an effective modality for Person ReID, especially in scenarios where individuals wear different colored clothes or over a period of several months. We explore the use o...

متن کامل

One-Shot Person Re-identification with a Consumer Depth Camera

In this chapter, we propose a comparison between two techniques for oneshot person re-identification from soft biometric cues. One is based upon a descriptor composed of features provided by a skeleton estimation algorithm; the other compares body shapes in terms of whole point clouds. This second approach relies on a novel technique we propose to warp the subject’s point cloud to a standard po...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computer Vision and Image Understanding

سال: 2022

ISSN: ['1090-235X', '1077-3142']

DOI: https://doi.org/10.1016/j.cviu.2021.103352